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) GENERAL INFORMATION: 



(i) 



APPLICANT : 



SEQUENCE LISTING 



(A) NAME: LONZA AG 

(B) STREET: Muenchensteinerstrasse 38 

(C) CITY: Basle 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE: 4002 

(ii) TITLE OF INVENTION: Process for the 

preparation of (S)-or (R) -3,3, 3-trif luoro-2- 
hydroxy-2-methylpropionic acid 

(iii) NUMBER OF SEQUENCES: 14 

(iv) COMPUTER-READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(c) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version 
#1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: Genomic DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(vi) ORIGIN: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(vii) PROVENANCE: 

(B) CLONE (S) : pPRS2a 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join ( 197 .. 1181 ) 

(D) OTHER INFORMATION: /product = "amidase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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CCCGGGAACT CCATGTGGCC GTGATCCTGG TCGAGCAGGA TATTGCGATG ATCCAGCGGG €0 

CCCCACAGCG CTGTGCGGTA ATGGATAAAG GCCTGGTTGT AGAAACGCTG ACCCAACAAC 120 

AGCTCTCTGA TGAT CTTTT A ATGCGTCGTC ATCTGGCTCT GTAACTAAAC GCTATAAATT 180 

ACGTGGAGAA TAACAT ATG AAA TGG TTG GAA GAA TCC ATT ATG GCC AAA 229 
Met Lys Tro Leu Glu Glu Ser lie Met Ala Lys 
I S 10 
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CGC GGT GTT GGT GCC GGG CCT AAA CCG GTA ACG CAT CAC CTC ACG GAA 277 
Arg Gly Val Gly Ala Gly Arg Lys Pro Val Thr His His Leu tut giu 

15 2° 2 

GAA ATG CAA AAA GAG TTT CAT TACACC ATT GGC CCT TAT TCC ACA CCC 325 
Glu Hec Gin Lys Glu Phe His Tyr Thr He Gly Pro Tyr Ser Thr Pro 
30 35 40 

GTC CTG ACC ATC GAA CCC GGT GAC CGG ATT ATT GTC GAC ACT CGA GAT 373 
Val Leu Thr He Glu Pro Gly Asp Arg He He Val Asp Tnr Arg Asp 
45 50 55 

GCT TTT GAA GGT GCT ATC AAT TCG GAA CAG GAT ATT CCG AGC CAG TTG 421 
Ala Phe Glu Gly Ala He Asn Ser Glu Gin Asp He Pro ser Gin wu 
60 65 70 ,s 

CTA AAA ATG CCC TTT CTC AAC CCA CAA AAC GGA CCG ATC ATG GTC AAT 469 
Leu Lys Met Pro Phe Leu Asn Pro Gin Asn Gly Pro He Met Val Asn 
80 85 

GGC GCG GAG AAA GGT GAT GTC CTC GCT GTC TAT ATC GAA TCC ATC TTG 517 
Gly Ala Glu Lys Gly Asp Val Leu Ala val Tyr He Glu Ser Met Leu 
1 95 100 105 

CCC CGC GGC GTT GAT CCC TAC GGC ATC TGC GCC ATC ATT CCG CAT TTT 565 
Pro Arg Gly Val Asp Pro Tyr Gly He Cys Ala Met He Pro His Phe 
110 US 120 

GGC GGA CTG ACC GGG ACC GAC CTG ACG GCC ATC CTC AAT GAT CCG CTG 613 
Gly Gly Leu Thr Gly Thr Asp Leu Thr Ala Met Leu Asn Asp Pro Leu 
125 130 135 

CCA GAA AAG GTG CGC ATG ATT AAA CTC GAC AGT GAA AAG GTC TAC TGG 661 
£o Slu Lys Val Arg Met He Lys Leu Asp Ser Glu Lys Val Tyr Trp 
140 145 150 x " 

AGC AAA CGC CAT ACG CTT CCC TAT AAA CCC CAT ATT GGC ACC TTG AGC 709 
Ser Lys Arg His Thr Leu Pro Tyr Lys Pro His He Gly Thr Leu Ser 

165 a / u 



160 



GTA TCG CCA GAA ATT GAC TCA ATC AAT TCA CTG ACG CCA GAC AAT CAC 757 
Val Ser Pro Glu lie Asp Ser lie Asn Ser Leu Thr Pro Asp Asn His 

180 1» 5 



175 



GGC GGG AAT ATG GAT GTG CCG GAT ATA GGA CCA GGG AGT ATT ACC TAT 805 
Gly Gly Asn Met Asp Val Pro Asp lie Gly Pro Gly Ser lie Thr Tyr 
190 195 200 



CTG CCG GTA CGT GCG CCT GGA GGC CGC CTG TTT ATT GGT GAT GCC CAT 853 
Leu Pro Val Arg Ala Pro Gly Gly Arg Leu Phe lie Gly Asp Ala His 
205 210 215 

GCT TGT CAG GGT GAT GGT GAG ATT TGC GGG ACC GCA GTA GAG TTT GCC 901 
Ala Cys Gin Gly Asp Gly Glu He Cys Gly Thr Ala Val Glu Phe Ala 
220 225 230 235 

TCA ATC ACC ACC ATC AAA GTC GAT TTG ATC AAG AAC TGG CAG CTT TCC 949 
Ser He Thr Thr He Lys Val Asp Leu lie Lys Asn Trp Gin Leu Ser 
240 245 250 

TGG CCA CGA ATG GAG AAT GCC GAA AAT ATT ATG AGT ATT GGC AGT GCA 997 
Trp Pro Arg Met Glu Asn Ala Glu Asn He Met Ser He Gly Ser Ala 
255 260 265 
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CGT CCG CTG GAG GAT GCG ACG CGA ATT CCA TAT CGC GAC TTA ATT TAC 1045 
Arg Pro Leu Glu Asp Ala Thr Arg He Ala Tyr Arg Asp Leu He Tyr 
270 275 280 

TGG CTG GTA GAA GAC TTT GGC TTC GAA CAA TGG GAT GCC TAC ATG CTT 1093 
Trp Leu Val Glu Asp Phe Gly Phe Glu Gin Trp Asp Ala Tyr Met Leu 
285 290 295 

CTG AGT CAA TGC GGC AAA GTG CGG CTG GGC AAC ATG GTC GAC CCC AAA 1141 
Leu Ser Gin Cys Gly Lys Val Arg Leu Gly Asn Met Val Asp Pro Lys 
300 305 310 315 

TAC ACC GTT GGC GCG ATG CTG AAC AAA AAC CTG TTA GTT TAGTAGGAAT 1190 
Tyr Thr Val Gly Ala Met Leu Asn Lys Asn Leu Leu Val 
320 325 

AACTAACCGG TGAACATTAC CCGGATGTAG ATCGGGGTAA TGTGTAAGTT CAAACAATCG 1250 

CTATTTTTAA CAGCTAAAGC AGGTGCATAT GGGGCCAGAT ACACCCATCA ATATTGGTTT 1310 

ACTTTACTCC TTCAGCGGAG TGACGGCGGC ACAAGAGTTG TCACAATGGC GCGGAGCAAC 1370 

CCAGGCTATT GCCGAAATTA ATCAAAATGG CGGCATCAAC GGCAGACCAC TCAATGCAAT 1430 

TCATTTGGAT CC 1442 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Trp Leu Glu Glu Ser He Met Ala Lys Arg Gly Val Gly Ala 
1 5 10 * 15 

Gly Arg Lys Pro Val Thr His His Leu Thr Glu Glu Met Gin Lys Glu 
20 25 30 

Phe His Tyr Thr He Gly Pro Tyr Ser- Thr Pro Val Leu Thr He Glu 
35 40 45 

Pro Gly Asp Arg He He Val Asp Thr Arg Asp Ala Phe Glu Gly Ala 
50 55 60 

He Asn Ser Glu Gin Asp He Pro Ser Gin Leu Leu Lys Met Pro Phe 
65 70 75 80 

Leu Asn Pro Gin Asn Gly Pro He Met Val Asn Gly Ala Glu Lys Gly 
65 90 95 

Asp Val Leu Ala Val Tyr He Glu Ser Met Leu Pro Arg Gly Val Asp 
100 105 110 

Pro Tyr Gly He Cys Ala Met lie Pro* His Phe Gly Gly Leu Thr Gly 
115 * 120 125 

Thr Asp Leu Thr Ala Met Leu Asn Asp Pro Leu Pro Glu Lys Val Arg 
130 135 140 
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Met lie Lys Leu Asp Ser Glu Lys Val Tyr Trp Ser Lys Arg His Thr 
145 150 155 160 

Leu Pro Tyr Lys Pro His He Gly Thr Leu Ser Val Ser Pro Glu He 
165 170 175 

Asp Ser He Asn Ser Leu Thr Pro Asp Asn His Gly Gly Asn Met Asp 
180 185 190 

Val Pro Asp He Gly Pro Gly Ser lie Thr Tyr Leu Pro Val Arg Ala 
195 200 205 

Pro Gly Gly Arg Leu Phe He Gly Asp Ala His Ala Cys Gin Gly Asp 
210 ~ 215 22Z 

Gly Glu He Cys Gly Thr Ala Val Glu Phe Ala Ser He Thr Thr He 
225 230 235 240 

Lys Val Asp Leu He Lys Asn Trp Gin Leu Ser Trp Pro Arg Met Glu 
245 250 255 

Asn Ala Glu Asn He Met Ser He Gly Ser Ala Arg Pro Leu Glu Asp 
260 265 270 

Ala Thr Arg He Ala Tyr Arg Asp Leu lie Tyr Trr Leu Val Glu Asp 
275 280 285 

Phe Gly Phe Glu Gin Trp Asp Ala Tyr Met Leu Leu Ser Gin Cys Gly 
290 295 3C: 

Lys Val Arg Leu Gly Asn Met Val Asp Pro Lys Tyr Thr Val Gly Ala 
305 310 315 320 

Met Leu Asn Lvs Asn Leu Leu Val 
325 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Lys Trp Leu Glu Glu Ser He Met Ala Lys Arg Gly Val Gly Ala 
15 10 15 

Ser Arg Lys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 
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(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Val Tyr Trp Ser Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Pro Val Thr His His Leu Thr Glu Glu Met Gin Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Tyr Thr Val Gly Ala Met Leu Asn Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 
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(D) TOPOLOGY: not known 
(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu Asn Ala Glu Asn He Met Ser He Gly Ser Ala Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Trp Leu Glu Glu Ser lie Met Ala Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Pro Phe Leu Asn Pro Gin Asn Gly Pro He Met Val Asn Gly Ala 
15 10 15 

Glu Lys 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Ala Phe Glu Gly Ala lie Asn Ser Glu Gin Asp He Pro Ser Gin 
1 5 10 15 

Leu Leu Lys 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 



(ii) MOLECULE TYPE: peptide 



(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
Glu Phe His Tvr Thr He Glv Pro Tyr Ser Thr Pro Val Leu Thr lie 
1.5 10 15 

Glu Pro Gly Asp Arc 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Leu Phe He Gly Asp Ala His Ala Glu Gin Gly Asp Gly Glu He Glu 



10 



15 



Gly Thr Ala Val Glu Phe Ala 
20 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Asp Val Leu Ala Val Tyr lie Glu Ser Met Leu Pro Arc 
1 S 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(C) INDIVIDUAL /ISOLATE: PRS1 

( vi i ) PROVENANCE : 

(B) CLONE (S) : PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Val Asp Pro Tyr Gly lie Glu Ala Met lie Pro His Phe C-* v Glv 
1 5 10 - =' 

Leu Thr Gly Thr Asp Leu Thr Ala Met Leu Asn Asd Gin Leu G* n P~o 
20 25 30 

Lys 



